Developing Speech Synthesis for Under-Resourced Languages by "Faking it": An Experiment with Somali
نویسندگان
چکیده
Speech synthesis or text-to-speech (TTS) systems are currently available for a number of the world’s major languages, but for thousands of other, unsupported, languages no such technology is available. While awaiting the development of such technology, we propose using an existing TTS system for a major language (the base language, BL) to “fake” TTS for an unsupported language (the target language, TL). This paper describes the factors which determine the choice of a suitable BL for a given TL, and describe an experiment with a fake Somali TTS system evaluated in the real-life situation of a doctor–patient dialogue. 28 Somali participants were asked to judge the comprehensibility of 25 short Somali sentences recorded with a German TTS system. Results suggest that “faking it” provides reasonable stop-gap TTS for unsupported languages.
منابع مشابه
Home-made speech synthesis for non-English-speaking patients
This poster concerns the development of computer-based support for health-care seekers with limited English, focusing on speech synthesis and on languages for which such technology has not been developed. Speech synthesis (or text-to-speech – TTS) systems are available only for the world’s major languages. In the absence of such technology, we want as a stop-gap solution to use an existing TTS ...
متن کاملFaking it: Synthetic Text-to-speech Synthesis for Under-resourced Languages - Experimental Design
Speech synthesis or text-to-speech (TTS) systems are currently available for a number of the world’s major languages, but for thousands of the world’s ‘minor’ languages no such technology is available. While awaiting the development of such technology, we would like to try the stop-gap solution of using an existing TTS system for a major language (the base language) to ‘fake’ TTS for a minor la...
متن کاملAutomatic Speech Recognition for Under-Resourced Languages:
Speech processing for under-resourced languages is an active field of research, which has experienced significant progress during the past decade. We propose, in this paper, a survey that focuses on automatic speech recognition (ASR) for these languages. The definition of under-resourced languages and the challenges associated to them are first defined. The main part of the paper is a literatur...
متن کاملTowards automatic cross-lingual acoustic modelling applied to HMM-based speech synthesis for under-resourced languages
Nowadays Human Computer Interaction (HCI) can also be achieved with voice user interfaces (VUIs). To enable devices to communicate with humans by speech in the user’s own language, low-cost language portability is often discussed and analysed. One of the most time-consuming parts for the language-adaptation process of VUIcapable applications is the target-language speech-data acquisition. Such ...
متن کاملThe development of new corpora for under-resourced languages using data available for well-resourced ones
In the paper we propose to exploit existing corpora of wellresourced languages as a basis for developing similar corpora of under-resourced ones. The construction of this type of corpora will allow finding common patterns of acoustic manifestation of similar functional states regardless of the language. The analysis of these corpora will also allow investigating universal and language-specific ...
متن کامل